(19) 



Europaisches Patentamt 
European Patent Office 
Office europeen des brevets 



(12) 



(43) Date of publication: 

25.03.1998 Bulletin 1998/13 



(n) EP 0 831 596 A2 

EUROPEAN PATENT APPLICATION 

(51) int. CI. 6 : H04B 1/66 



(21) Application number: 97115447.1 

(22) Date of filing: 06.09.1997 



(84) Designated Contracting States: 


(72) Inventors: 


AT BE CH DE DK ES Fl FR GB GR IE IT LI LU MC 


• Schrdder, Ernst F. 


NLPTSE 


30655 Hannover (DE) 


(30) Priority: 20.09.1996 DE 19638546 


• Spille, Jens 
30966 Hemmingen (DE) 


(71) Applicant: 

DEUTSCHE THOMSON-BRANDT GMBH 
78048 Villingen-Schwenningen (DE) 


(74) Representative: 

Hartnaek, Wolfgang, Dipl.-lng. et al 
Deutsche Thomson-Brandt GmbH 
Licensing & Intellectual Property, 
Gottinger Chaussee 76 
30453 Hannover (DE) 



(54) Method and circuit arrangement for coding or decoding audio signals 



(57) In audio source coding, knowledge from the 
field of psycho-acoustics is utilized in order to reduce 
the data rate. The characteristic properties of psycho- 
acoustics are highly dependent on the sound pressure 
level of the signal. A specific audible loudness is there- 
fore normally assumed. This can lead to an impairment 
of the signal quality if the signal is subsequently inten- 
sively processed. Thus, the quantization noise N can 
become perceptible if, on the receiver side, the signal S 
is enhanced with the aid of a dynamic compression cir- 



cuit VCA1, or if a signal compression VCA2 is carried 
out before the coding ENC2 and a signal expansion 
VCA3 is carried out after the decoding DEC2. In order 
to avoid this impairment of the signal, the psycho- 
acoustic characteristic values which are relevant to the 
coding and decoding are selected as a function of the 
audible loudness resulting from the dynamic compres- 
sion and possible dynamic expansion. 



vuai > 



Fig.3 



EP 0 831 596 A2 



Description 

The invention relates to a method and a circuit 
arrangement for coding or decoding audio signals, psy- 
cho-acoustic effects being utilized and dynamic com- 
pression being performed. 



In audio source coding, knowledge about human 
hearing which has been gained from the field of psycho- 
acoustics is utilized in order to reduce the data rate. In 
this case, the redundancy and irrelevance contained in 
the audio signal, that is to say the information which is 
superfluous or not perceptible in this context, is utilized 
in particular. During the irrelevance reduction, on the 
one hand irrelevant signal components are not coded 
and hence not transmitted. On the other hand, distor- 
tions which are produced by slight quantization are tol- 
erated if they are not registered by the hearing because 
of masking effects. Masking is understood here to mean 
a psycho-acoustic behaviour of the hearing in which a 
sound with a low sound level is masked by an adjacent 
sound with a high sound level. This can be utilized for 
data reduction, in that the masked signal components 
are separated from those components which are signif- 
icant for the human hearing impression. The coding 
then takes into account only those components which 
are relevant for hearing. However, the coding of the sig- 
nals above the audibility threshold can then be carried 
out using low quantization, so that the quantization 
noise which is produced still lies below the audibility 
threshold. 

Masking is utilized, for example, in the MUSICAM 
method. In the case of MUSICAM, the audio signal is 
split up into 32 partial bands, the audibility threshold 
being calculated for each partial band. In this case, both 
the masking is taken into account and a resulting quan- 
tization noise is determined from the quiescent hearinq 
threshold. 



The invention is based on the object of specifying a 
method to improve the signal quality in the reproduction 
of audio signals. This object is achieved by the method 
specified in Claim 1. 

The invention is based on the further object of spec- 
ifying a circuit arrangement to apply the method accord- 
ing to the invention. This object is achieved by the circuit 
arrangements specified in Claims 4 and 7. 

The characteristic properties of psycho-acoustics 
are highly dependent on the sound pressure level of the 
signal. A specific audible loudness is therefore normally 
assumed. This can lead to an impairment of the signal 
quality if the signal is subsequently intensively proc- 
essed, for example using a dynamic compressor or 
dynamic expander. 



If an audio encoder according to the prior art is fed 
with a signal S (here a sinusoidal signal) of low loud- 
ness (e.g. 60 dB) for coding, then the quantization noise 
N is set such that the latter lies somewhat below the qui- 
5 escent hearing threshold T1 of about 0 dB (see Fig. 1 A). 
If, on the receiver side at the output of the decoder, the 
signal S is then enhanced by 15 dB. for example, with 
the aid of a dynamic compression circuit, then the quan- 
tization noise N becomes perceptible, as illustrated in 
w Fig. 1 B. If, on the other hand, an audio signal is com- 
pressed in the studio before coding and subsequently 
expanded once more on the receiver side, then the 
dependence of the masking curve on the loudness 
leads to an impairment of the signal. As illustrated in 
15 Fig. 2, the masking curve T2 changes its slope as a 
function of the loudness. Thus, it is flatter, in the higher- 
frequency range, for high loudnesses (see Fig. 2A) than 
for low loudnesses (see Fig. 2B), that is to say the 
human ear is more sensitive to high sounds at low loud- 
20 nesses than at high loudnesses. If a compressed loud 
signal S is coded, then it is possible to permit a compare 
atively high quantization noise N. However, if this signal 
is expanded after the decoding, then the actual masking 
curve in the human ear becomes significantly steeper, 
25 and the quantization noise becomes perceptible as a 
result. In order to avoid this impairment of the signal, it 
would indeed be possible, in general, to proceed from a 
quiescent hearing threshold with a distinctly lower level 
or from a distinctly lower sound pressure, but this would 
30 lead to a distinctly higher data rate. 

In principle, therefore, the method according to the 
invention for coding and decoding audio signals, in 
which psycho-acoustic effects are utilized and dynamic 
compression is performed, consists in the fact that, dur- 
35 ing the coding, a psycho-acoustic characteristic value or 
a plurality of psycho-acoustic characteristic values is or 
are selected as a function of the dynamic compression, 
and that the decoding is carried out using these charac- 
teristic values. 

to On the decoder side, the dynamic compression is 
advantageously performed after the decoding. The psy- 
cho-acoustic characteristic value is then preferably the 
level of the quiescent hearing threshold. 

It is likewise advantageous, on the decoder side, to 
45 perform dynamic expansion of the signals compressed 
on the encoder side. The psycho-acoustic characteristic 
value is then preferably the slope of the masking curve. 

In principle, the circuit arrangement according to 
the invention for encoding comprises an encoder for 
so coding audio signals, in which encoder psycho-acoustic 
effects are utilized, and a unit for obtaining a compres- 
sion factor which is vised for a dynamic compression of 
the audio signals, the encoder being fed, by the unit for 
obtaining the compression factor, with a compression- 
55 factor signal on which a psycho-acoustic characteristic 
value in the coding in the encoder depends. 

Preferably, the compression-factor signal contains 
information about the current loudness level and/or 
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about the current compression factor. 

Advantageously, the audio signal is fed to the unit 
for obtaining the compression factor and the compres- 
sion-factor signal is additionally fed to a dynamic com- 
pressor which compresses the audio signal before the 
coding, the compressed audio signal then being fed to 
the audio encoder. 

In principle, the circuit arrangement according to 
the invention for decoding comprises a decoder for 
decoding audio signals which were coded utilizing psy- 
cho-acoustic effects, and a unit for changing the 
dynamic range of the audio signal, the unit for changing; 
the dynamic range being fed a compression-factor sig- 
nal on which a psyche-acoustic characteristic value in 
the coding in the encoder depended, the change in the 
dynamic range on the decoder side being a function of 
the compression-factor signal supplied. 

ft is advantageous here if a unit for obtaining the 
compression factor is fed with the decoded audio sig- 
nals, and a compression-factor signal generated by this 
unit is fed to the unit for changing the dynamic range, 
the latter unit performing dynamic compression of the 
decoded audio signals. 

It is likewise advantageous if a compression-factor 
signal generated on the encoder side is fed to the unit 
for changing the dynamic range, the latter unit perform- 
ing dynamic expansion of the audio signals compressed 
on the encoder side. 

Drawings 

Exemplary embodiments of the invention are 
described using the drawings, in which: 

Fig. 1 shows an audio signal after decoding, with 
the masking threshold and quantization 
noise, before (A) and after (B) dynamic 
expansion; 

Fig. 2 shows an audio signal after decoding, with 
the quiescent hearing threshold and quanti- 
zation noise, before (A) and after (B) 
dynamic compression; 

Fig. 3 shows the method according to the invention 
using a dynamic compressor; 

Fig. 4 shows the method according to the invention 
using a dynamic expander. 

Exemplary embodiments 

The method according to the invention with 
dynamic compression on the receiver side is illustrated 
in Fig. 3. Such a method is of interest, in particular, in 
the reproduction of audio signals by simple reproduction 
devices or car radios. In such cases, the restricted proc- 
essable dynamic range or ambient noise makes it expe- 
dient to enhance quiet signal components of the audio 
signal in order to make these audible. For this, the audio 
signal SE is first fed on the transmitter side to a unit 



COMP for obtaining a compression factor and to the 
actual audio encoder ENC1. The coding of the audio 
signals SE is then carried out in the audio encoder 
ENC1, taking into account the psycho-acoustic effects 
s described above. In addition, the encoder contains a 
separate input LT1, via which the encoder is fed, by the 
unit COMPE for obtaining the compression factor, with 
information about the current loudness level, in order to 
determine the current quiescent hearing threshold. The 
10 coded signal is then transmitted and, on the receiver 
side, decoded once more by the audio decoder DEC1, 
this decoder containing the reciprocal functions of the 
coder. The decoded signal is then compressed by the 
compressor VCA1, the compressor being fed with the 
is compression factor by the unit COMPD for obtaining the 
compression factor. As a result, the compressed audio 
signal SD is, finally, available. 

That method according to the invention in which 
dynamic expansion is earned out on the receiver side is 
20 illustrated in Fig. 4. In this oase, in order to improve the 
signal reproduction, the dynamic compression is 
already carried out on the encoder or transmitter side. 
Whereas simple receivers then only reproduce the com- 
pressed signal, a hi-fi receiver carries out an expansion 
25 of the compressed signal, with the result that the audio 
signal can then be reproduced over the complete 
dynamic range. Here too. the audio signal SE is first fed 
to a unit COMP for obtaining the compression factor. 
The compression factor determined is then fed to the 
30 compressor VCA2, which compresses the audio signals 
SE before the coding. The compressed signal is then 
fed to the audio encoder ENC2, to which the compres- 
sion factor is also fed via the input LT2. in order to have 
a measure of the current loudness level. This makes it 
35 possible, depending on the loudness, to select the 
masking threshold curve which is optimal in each case. 
The audio signal is then appropriately coded and 
recorded or transmitted together with the compression 
factor. In the case of a broadcast transmission, the com- 
40 pression factor can either be transmitted jointly with 
other radio data, with the aid of an additional data chan- 
nel, for example in accordance with the European RDS 
or American RBDS standard, or can be inserted inaudi- 
bly into the actual audio signal. The latter provides the 
45 advantage that the compression factors are always 
transmitted synchronously with the sound signal, so that 
these compressin factors are also still available follow- 
ing a sound recording. For the purpose of reproduction, 
an expansion by the dynamic expander VCA3 is then 
so carried out after the decoding of the transmitted audio 
signal by the audio decoder DEC2. the said dynamic 
expander being' fed with the transmitted compression 
factor for this purpose. The expanded audio signal SD is 
thus available for further processing or reproduction. 
55 The invention can be used for the transmission of 
digital audio signals by transmission systems such as. 
for example, DAB or ADR, but also for the recording and 
reproduction of digital audio signals by, for example, 
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DVD or DVC devices. 
Claims 

1. Method for coding or decoding audio signals, in s 
which psycho-acoustic effects are utilized and 
dynamic compression is performed, characterized 
in that, during the coding, a psycho-acoustic char- 
acteristic value or a plurality of psycho-acoustic 
characteristic values (T1 , T2) is or are selected as a it 
function of the dynamic compression, and in that 
the decoding can be carried out using these char- 
acteristic values. 

Method according to Claim 1 , characterized in that, is 
on the decoder side, the dynamic compression is 
performed after the decoding. 

t. Method according to Claim 1 or 2, characterized in 
that the psycho-acoustic characteristic value is the so 
level of the quiescent hearing threshold (T1 ). 

- Method according to Claim 1 , characterized in that, 
on the encoder side, the dynamic compression is 
carried out before the coding, and dynamic expan- 25 
sion is carried out after the decoding. 

. Method according to Claim 1 or 4. characterized in 
that the psycho-acoustic characteristic value is the 
slope of the masking curve (T2). 30 

Circuit arrangement having an encoder (ENC1, 
ENC2) for coding audio signals (SE), in which 
encoder psycho-acoustic effects are utilized, and 
having a unit (COMP, COMPE) for obtaining a com- 35 
pression factor which is used for a dynamic com- 
pression of the audio signals on the encoder or 
decoder side, characterized in that the encoder is 
fed, by the unit for obtaining the compression factor, 
with a compression-factor signal (CF, CFE) on to 
which a psycho-acoustic characteristic value in the 
coding in the encoder depends. 

Circuit arrangement according to Claim 6, charac- 
terized in that the compression-factor signal (CF, 45 
CFE) contains information about the current loud- 
ness level and/or about the current compression 
factor. 



9. Circuit arrangement having a decoder (DEC1, 
DEC2) for decoding audio signals which have been 
coded utilizing psycho-acoustic effects, and having 
a unit (VCA1, VCA3) for changing the dynamic 
range of the audio signal, characterized in that the 
unit for changing the dynamic range is fed with a 
compression-factor signal (CF, CFD) on which a 
psycho-acoustic characteristic value in the coding 
in the encoder depended, the change in the 
1 dynamic range on the decoder side being a function 
of the compression-factor signal supplied. 

10. Circuit arrangement according to Claim 9, charac- 
terized in that a unit (COMPD) for obtaining the 
compression factor is fed with the decoded audio 
signals, and a compression-factor signal (CFD) 
generated by this unit is fed to the unit (VCA1) for 
changing the dynamic range/the latter unit per- 
forming dynamic compression of the decoded 
audio signals. 

11. Circuit arrangement according to Claim 9. charac- 
terized in that a compression-factor signal {CF) 
generated on the encoder side is fed to the unit 
(VCA3) for changing the dynamic range, the latter 
unit performing dynamic expansion of the audio sig- 
nals compressed on the encoder side. 



Circuit arrangement according to Claim 6 or 7, so 
characterized in that the audio signal (SE) is fed to 
the unit (COMP. COMPE) for obtaining the com- 
pression factor, the compression-factor signal (CF) 
determined from the audio signal being additionally 
fed to a dynamic compressor (VCA2) which com- 55 
presses the audio signal (SE) before the coding, 
the compressed audio signal then being fed to the 
audio encoder (ENC2). 
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(54) Method and circuit arrangement for coding or decoding audio signals 



(57) In audio source coding, knowledge from the 
field of psycho-acoustics is utilized in order to reduce 
the data rate. The characteristic properties of psycho- 
acoustics are highly dependent on the sound pressure 
level of the signal. A specific audible loudness is there- 
fore normally assumed. This can lead to an impairment 
of the signal quality if the signal is subsequently inten- 
sively processed. Thus, the quantization noise N can 
become perceptible if, on the receiver side, the signal S 
is enhanced with the aid of a dynamic compression cir- 



cuit VCA1 , or if a signal compression VCA2 is carried 
out before the coding ENC2 and a signal expansion 
VCA3 is carried out after the decoding DEC2. In order 
to avoid this impairment of the signal, the psycho- 
acoustic characteristic values which are relevant to the 
coding and decoding are selected as a function of the 
audible loudness resulting from the dynamic compres- 
sion and possible dynamic expansion. 
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